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NON-UNIQUE GAMES OVER COMPACT GROUPS AND ORIENTATION 

ESTIMATION IN CRYO-EM 

AFONSO S. BANDEIRA, YUTONG CHEN, AND AMIT SINGER 


Abstract. Let ^ be a compact group and let fij € L^{Q). We define the Non-Unique Games 
(NUG) problem as finding gi,... ,gn £ C/ to minimize (di9j~^)- We devise a relaxation 

of the NUG problem to a semidefinite program (SDP) by taking the Fourier transform of fij over 
Q, which can then be solved efficiently. The NUG framework can be seen as a generalization 
of the little Grothendieck problem over the orthogonal group and the Unique Games problem 
and includes many practically relevant problems, such as the maximum likelihood estimator to 
registering bandlimited functions over the unit sphere in d-dimensions and orientation estimation 
in cryo-Electron Microscopy. 


1. Introduction 


We consider problems of the following form 


minimize 

9i,—,gn 


X [didj 
i,j=^ 

subject to Qi £ G, 


-1 


( 1 . 1 ) 


where ^ is a compact group and fij : ^ M are suitable functions. We will refer to such problems 
as a Non-Unique Game (NUG) problem over Q. 

Note that the solution to the NUG problem is not unique. If gi,gn is a solution to (1.1), 


then so is gig ,..., gng for any g ^ Q. That is, we can solve (1.1) up to a global shift g ^ Q. 

Many inverse problems can be solved as instances of (1.1). A simple example is angular syn¬ 
chronization [SHE], where one is tasked with estimating angles {9i}i from information about their 
offsets 9i — 9j mod 27r. The problem of estimating the angles can then be formulated as an opti¬ 


mization problem depending on the offsets, and thus be written in the form of (1.1). In general. 


many inverse problems, where the goal is to estimate multiple group elements from information 


about group offsets, can be formulated as (1.1). 

One of the simplest instances of 0 is the Max-Cut problem, where the objective is to partition 
the vertices of a graph as to maximize the number of edges (the cut) between the two sets. In this 
case, G = Z 2 , the group of two elements {±1}, and fij is zero if {i,j) is not an edge of the graph 
and 

fij{l) = 0 and fij{-l) = -1, 


if {i,j) is an edge. In fact, we take a semidefinite programming based approach towards (1.1) that is 
inspired by — and can be seen as a generalization of— the semidefinite relaxation for the Max-Cut 
problem by Goemans and Williamson im. 

Another important source of inspiration is the semidefinite relaxation of Max-2-Lin(Zj;,), pro¬ 
posed in for the Unique Games problem, a central problem in theoretical computer sci¬ 
ence [201 [2l]. Given integers n and L, an Unique-Games instance is a system of linear equations 
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over 'Ll on n variables {xiYl^^. Each equation constraints the difference of two variables. More 
precisely, for each {i,j) in a subset of the pairs, we associate a constraint 

Xi — Xj = bij mod L. 

The objective is then to hnd {xi}^^^ in Z^, that satisfy as many equations as possible. This can be 
easily described within our framework by taking, for each constraint, 

fijis) = ~^g=bij, 

and fij = 0 for pairs not corresponding to constraints. The term “unique” derives from the fact 
that the constraints have this special structure where the offset can only take one value to satisfy 
the constraint, and all other values have the same score. This motivated our choice of nomecluture 
for the framework treated in this paper. The semidefinite relaxation for the unique games problem 
proposed in m was investigated in [7] in the context of the signal alignment problem, where the 
fij are not forced to have a special structure (but Q = Ll)- The NUG framework presented in this 
paper can be seen as a generalization of the approach in [7j to other compact groups Q. 

Besides the signal alignment treated in [7] the semidehnite relaxation to the NUG problem we 
develop coincides with other effective relaxations. When ^ = Z 2 it coincides with the semidefinite 
relaxations for Max-Cut m, little Grothendieck problem over Z 2 [3125], recovery in the stochastic 
block model [21 [5] j and Synchronization over Z 2 (HOlIlj. When Q = 50(2) and the functions fij 
are linear with respect to the representation pi : 50(2) —>• C given by the pi{8) = e*^, it coincides 
with the semidehnite relaxation for angular synchronization m- Similarly, when Q = 0{d) and the 
functions are linear with respect to the natural d-dimensional representation, then the NUG problem 
essentially coincides with the little Grothendieck problem over the orthogonal group [3 [23]. Other 
examples include the shape matching problem in computer graphics for which ^ is a permutation 
group (see mm)- 

1 . 1 . Orientation estimation in cryo-Electron Microscopy. A particularly important appli¬ 
cation of this framework is the orientation estimation problem in cryo-Electron Microscopy |28) . 




000606 
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Eigure 1.1. Illustration of the cryo-EM imaging process: A molecule is imaged af¬ 
ter being frozen at a random (unknown) rotation and a tomographic 2-dimensional 
projection is captured. Given a number of tomographic projections taken at un¬ 
known rotations, we are interested in determining such rotations with the objective 
of reconstructing the molecule density. Images courtesy of Amit Singer and Yoel 
Shkolnisky |28|. 
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Cryo-EM is a technique used to determine the 3-dimensional structure of biological macro¬ 
molecules. The molecules are rapidly frozen in a thin layer of ice and imaged with an electron 
microscope, which gives noisy 2-dimensional projections. One of the main difficulties with this 
imaging process is that these molecules are imaged at different unknown orientations in the sheet 
of ice and each molecule can only be imaged once (due to the destructive nature of the imaging 
process). More precisely, each measurement consists of a tomographic projection of a rotated (by 
an unknown rotation) copy of the molecule. The task is then to reconstruct the molecule density 
from many such noisy measurements. In Sectionwe describe how this problem can be formulated 
in the form (1.1). 


2. Multireference Alignment 


In classical linear inverse problems, one is tasked with recovering an unknown element x € X 
from a noisy measurement of the form V{x) -|- e, where e represents the measurement error and V 
is a linear observation operator. There are, however, many problems where an additional difficulty 
is present; one class of such problems includes non-linear inverse problems in which an unknown 
transformation acts on x prior to the linear measurement. Specifically, let X he a vector space and 
^ be a group acting on X. Suppose we have n measurements of the form 

Vi = 'P{giO x) + ei, i = (2.1) 


where 


• X is a fixed but unknown element of X, 

• gi,... ,gn are unknown elements of Q, 

• o is the action of ^ on A, 

• "P : A —7- y is a linear operator, 

• Y is the (finite-dimensional) measurement space, 

• Cj’s are independent noise terms. 


If the 5 i’s were known, then the task of recovering x would reduce to a classical linear inverse 
problem, for which many effective techniques exist. For this reason, we focus on the problem of 
estimating gi,...,gn. 

There are several common approaches for inverse problems of the form (2.1). One is motivated 
by the observation that estimating x knowing the gi’s and estimating the gi's knowing x are both 
considerably easier tasks. This suggests a alternating minimization approach where each estimation 
is updated iteratively. Besides a lack of theoretical guarantees and a tendency to stall at local 
optima, these kind of approaches usually start with an initial guess for x and this can introduce 
model bias (c.f. the experiment in Figure 2.2). Another approach, which we refer to as pairwise 
comparisons m, consists in determining, from pairs of observations {yi,yj), the most likely value 
for gigj^■ Although the problem of estimating the giS from these pairwise guesses is fairly well- 
understood [271 uni EH] enjoying efficient algorithms and performance guarantees, this method 
suffers from loss of information as not all of the information of the problem is captured in this most 
likely value for gtgj^ and thus this approach tends to fail at low signal-to-noise-ratio. 

In contrast, the Maximum Likelihood Estimator (MLE) leverages all information and enjoys many 
theoretical guarantees. Assuming that the e^’s are i.i.d. Gaussian, the MLE for the observation 
model (2.1) is given by the following optimization problem: 


minimize 
subject to 


i=l 

9i ^ G 
x€X 


( 2 . 2 ) 
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We refer to (2.2) as the Multireference Alignment (MRA) problem. Unfortunately, the exponen¬ 
tially large search space and nonconvex nature of (2.2) often render it computationaly intractable. 
However, for several problems of interest, we formulate (2.2) as an instance of an NUG for which 
we develop efficient approximations. 


2.1. Registration of signals on the sphere. Consider the problem of estimating a bandlimited 
signal on the circle x : —)• C from noisy rotated copies of it. In this problem, X = span 

is the space of bandlimited functions up to degree t on 5^, ^ = SO{2) and the group action is 

gox= 

k=—t 

where x € X and we identihed g G 50(2) with 9g G [0, 27r]. 

The measurements are of the form 

yi:=V{giOx) + ei, i = l,...,n 

where 

• X G T, 

• gi G 50(2), 

• V : X ^ samples the function at L equally spaced points in 5^, 

• Ci ~ M{0, (t'^Ilxl) (i = 1,... , n) are independent Gaussians. 

Our objective is to estimate gi,... ,gn and x. Since estimating x knowing the group elements gi 
is considerably easier, we will focus on estimating gi,... ,gn- As shown below, this will essentially 
reduce to the problem of aligning (or registering) the observations yi,... ,yn- 
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Figure 2.1. Illustration of the registration problem in 5^. The first column consists 
of a noiseless example, the second column represents an instance for which the 
template is known and matched filtering is effective. However, in the examples we 
are interested in the template is unknown (last two columns) rendering the problem 
significantly harder. 


In absence of noise, the problem of finding the gfs is trivial (cf. first column of Figure [2T| ). With 
noise, if x is known (as it is in some applications), then the problem of determining the gi's can be 
solved by matched filtering (cf. second column of Figure 2.1). However, x is unknown in general. 
This, together with the high levels of noise, render the problem significantly more difficult (cf. last 
two columns of Figure |2.1[ ) . 

In fact, in the high noise regime, if one attempts to perform matched filtering with a reference 
signal yj that is not the true template x (as this is unknown), then there is a high risk of model bias: 
the reconstructed signal x tends to capture characteristics of the reference yj that are not present 
in the actual original signal x (see Figure [2^. This issue is well known among the biomedical 
imaging community (see [13] for a recent discussion). As Figure 2.2 suggests, the methods treated 
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Figure 2.2. A simple experiment to illustrate the model bias phenomenon: given 
a picture of the mathematician Hermann Weyl (second picture of the top row) we 
generate many images consisting of random rotations (we considered a discretization 
of the rotations of the plane) of the image with added gaussian noise. An example 
of one such measurements is the third image in the first row. We then proceeded 
to align these images to a reference consisting of a famous image of Albert Einstein 
(often used in the model bias discussions). After alignment, an estimator of the 
original image was constructed by averaging the aligned measurements. The result, 
first image on second row, is clearly closer to the image of Einstein than the one of 
Weyl, illustrating the model bias issue. On the other hand, the method proposed 
in [7] and generalized here produces the second image of the second row, which 
shows no signs of suffering from model bias. As a benchmark, we also include the 
reconstruction obtained by an oracle that is given the true rotations (third image in 
the second row). 


in this paper do not suffer from model bias as they do not use any information besides the data 
itself. 

We now define the problem of registration in d-dimensions in general. X = span{pfc}fcg_ 4 j is the 
space of bandlimited functions up to degree t on S'^ where the p^s are orthonormal polynomials 
on S'^, At indexes all pk up to degree t and Q = SO{d + 1). 

The measurements are of the form 


Vi :='P{giO x) + et, i = 


(2.3) 


where 

• X G X , 

• gi G SO{d + 1), 

• V : X samples the function on L points in 

• Cj ~ AA(0, (f = 1, ■.., n) are independent Gaussians. 

Again, our objective is to estimate gi,... ,gn and x. 
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Figure 2.3. An illustraction of the registration in 2-dimensions. The left four 
spheres provide examples of clean signals yi and the right four spheres are of noisy 
observations. Note that the images are generated using a quantization of the sphere. 


The MRA solution for registration in d-dimensions is given by 


minimize 

gi,...,gn,x 

subject to 


'^Wvi - 'Pidi ° x)\\l 

i=l 

Qi G SOid + 1) 

X £ X 


Let us remove x from (2.4). Let 


Q: 


A 


L 

yi i-A ^ 2/i(0 sinc(a; - uji), 
1=1 


(2.4) 


where {w/} C S'^ are the points sampled by V and sine is the multidimensional sine function. We 
define the group action • on the yi through its action on X by 


The group action of Q 


9 ^ ■yi-=P{9 ^ o QiVi)) ■ 


SO{d + 1) preserves the norm || • || 2 . 


Then (2.4) is equivalent to 


minimize 
subject to 


||<7i Pyi-V{x)\\^ 

i=l 


Qi G SO{d + 1) 


X G . 


(2.5) 


Recall that X is the space of bandlimited functions. If V samples x at sufficiently many well 
spread-out points (i.e. take a large L), then ||y~^yi ||2 = ||yi ||2 and ||'P(x )||2 can be easily estimated 
and thus considered approximately fixed. We approximate ( |2.5[ ) with 

n 

maximize (g~^ ■yi,V(x)) 

gi,...,g„,x 

2=1 

subject to gi G SO{d -|- 1). 


( 2 . 6 ) 
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For fixed gi^s, the maximizing x must satisfy 'P(x) = - ■ Vi- Then (2.6) is equivalent to 


maximize 

9li'“)9n 


2=1 




(2.7) 


subject to gi G SO{d + 1). 

We simplify the objective function in (|2.7|) and get the equivalent problem 


maximize 9igj ^ ' Vj 

subject to gi G SO{d + 1). 


( 2 . 8 ) 


Again, we use the observation that ||yi|| 2 ’s are approximately fixed. Then, (2.8) is equivalent to 

2 

2 (2.9) 


minimize 

pi vjpn 


E 


Ivi - 9i9j ^ ■ Vj 


hi=i 

subject to gi G SO{d + 1). 


In summary, (2.4) can be approximated by (|2.9|), which is an instance of (1.1) 


2.2. Orientation estimation in cryo-EM. The task here is to reconstruct the molecule density 
from many such measurements (see the second column of Figure o for an idealized density and 
measurement dataset). The linear inverse problem of recovering the molecule density given the 
rotations fits in the framework of classical computerized tomography for which effective methods 
exist. Thus, we focus on the non-linear inverse problem of estimating the unknown rotations and 
the underlying density. 



Figure 2.4. Sample images from the E. coli 50S ribosomal subunit, generously 
provided by Fred Sigworth at the Yale Medical School. 


An added difficulty is the high level of noise in the images. In fact, it is already non-trivial to 
distinguish whether a molecule is present in an image or if the image consists only of noise (see 
Figure 2.4 for a subset of an experimental dataset). On the other hand, these datasets consist of 
many projection images which renders reconstruction possible. 

We formulate the problem of orientation estimation in cryo-EM. Let X to be the space of ban- 
dlimited functions that are also essentially compactly supported in and G = SO(3). The 
measurements are of the form 


Ii{x,y) :='P{RiO (j)) + ei, i = l,...,n (2.10) 

• ()) G A, 

• R^ G SO(3), 

• 'P((?!>) samples 4){x,y, z)dz {V is called the discrete X-ray transform), 

• Cj’s are i.i.d Gaussians representing noise. 
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Our objective is to find gi,... ,gn and (j). 

The operator V in the orientation estimation problem is different than in the registration problem. 
Specifically, V is a composition of tomographic projection and sampling, thereby rendering the MLE 
more involved. To write the MLE solution for the orientaiton estimation problem, we will use the 
Fourier slice theorem |24j . 

The Fourier slice theorem states that the 2-dimensional Eourier transform of a tomographic 
projection of a molecule density (p coincides with the restriction to a plane normal to the projection 


direction, a slice, of the 3-dimensional Fourier transform of the density p. See Figure 2.5 
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Figure 2.5. An illustration of the use of the Fourier slice theorem and the common 
lines approach to the orientation estimation problem in cryo-EM. 


Let ii{r,9) be the Fourier transform of li in polar coordinates. We embed li and Ij in M^, and 
apply g~^ and g~^ to li and Ij, respectively. Then, the directions of the lines of intersection on Ij 
and Ij are given, respectively, by unit vectors 


cij (gigj 


9 i{ 9 .. 


-1 


63 X 5 


-1 


63 ) 


9 i{ 9 i ^ • 63 X 5 . ^ • 63 




9 j {9 


-1 

j 


63 X g. 


-1 


63) 


63 X gigj ^ ■ 63 
|e 3 X gigj^ ■ 63] 


-1 


63 X [gigj I • 63 


-1 


9 j{ 9 j ^ • 63 X 5/ • 63)) 


|63 X [gig 


-1 


-1 


where 63 := (0,0,1)^. See [28] for details. 

We seek the MRA solution on the lines of intersection. 


minimize 

9li-“i9n 


Fib 

*J=1 

subject to gi G SO{3), 


-1 


C-ij [ 9 i 9 j 


-Ir 


i I ■) Cji \ 9 i 9 j 


-I 


63 


and (2.13) is an instance of (1.1). 


( 2 . 11 ) 


( 2 . 12 ) 


(2.13) 


Note that for n = 2 images, there is always a degree of freedom along the line of intersection. 
In otherwords, we cannot recover the true orientation between Ii and F- However, for n > 3, 
this degree of freedom is eliminated. In general, the measurement system suffers from a handness 
ambuiguity on the reconstruction (see, for example, |2H|), this will be discussed in detail in a future 
version of this manuscript. It is also worth mentioning several important references in the context 
of angular reconstitution 
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3. Linearization via Fourier expansion 
Let us consider the objective function in the general form 

n 

hj {rngf) ■ (3.1) 

*.i=i 

Note that each fij in (0 can be nonlinear and nonconvex. However, since Q is compact (and 
since fij G Lf‘{Q)), we can expand, each fij in Fourier series. More precisely, given the unitary 
irreducible representations {pk} of Q, we can write 



^4tr fij{k)pk i^giPj^ 
k=0 
oo 

^4tr fij{k)pk{gi)pU9j) 


k=0 


(3.2) 


where fij{k) are the Fourier coefficients of fij and can be computed from fij via the Fourier 
transform 


/u(^) := / fij{9)Pk{g )dg 

Jg 

= [ fij{9)pk{9)dg. 

Jg 

Above, dg denotes the Haar measure on Q and 4 the dimension of the representation pk- 
We express the objective function (3.1) as 


(3.3) 


Y \ 9i9j ^) = X] [fij{k)pk{gi)p*k{gj 

i,j=l k=0 

oo n 

= YY^’^^^ fijik)pk{9i)pk{9j 

k=0 i,j=l 


which is linear in Pk{gi)p*k{gj)- This motivates writing (1.1) as linear optimization over the variables 

:= 


Pk{gi) 


Pk{9l) 

Pk{Pn ) 


Pkig-n) 


In other words, 


n 


oo 




9i9. ^ I ^ 


^tr 


*d=i 

where the coefficient matrices are given by 

k=0 


:= dk 

1_ 

fi2{k) • 
f22{k) • 

• flnik)' 

• hnik) 


Jnlik) 

fn2ik) • 

■ fnn{k)_ 
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We refer to the dk x block of corresponding to Pk{9i)p*k{gj) = PkiSidj ■1) as Xip. We 
now turn our attention to constraints on the variables It is easy to see that: 

xW ^0,Vfc 


rank 


x(^) 


— dk ) ) 


X 


(k) 


Constraints (|3.4l), (13.5^ and (13.6^ ensure X*^^) is of the form 


(3.4) 

(3.5) 

(3.6) 

(3.7) 


x(^) = 

1 - 

1 _ 


1 - 

1 _ 


v{k) 

-Xn 


X-n 


(k) A -K (k) 

for some X) ' unitary x matrices. The constraint (3.7) attempts to ensure that X> ' is in 
the image of the representation of G. Notably, none of these constraints ensures that, for different 


(k) — I 

values of k, Xb correspond to the same group element piPj . Adding such a constraint would 
yield 

OO 

minimize tr 

k=0 

subject to X^^^ ^ 0 

= l 

x(^) 


— r, 


(3.8) 


rank 


X, 


(fc) 

'if 


— dk 

.-ll 


= PkidiPj ) for some pip- ^ G 


Unfortunately, both the rank constraint and the last constraint in (3.8) are, in general, nonconvex. 
We will relax (3.8) by dropping the rank requirement and replacing the last constraint by positivity 
constraints that couple different X^^^^’s. We achieve this by considering the Dirac delta funcion on 
G- Notice that the Dirac delta funcion 5{p) on the identity e G ^ can be expanded as 

OO 

%)=E dfc tr \5{k)pk{p) 


/c=0 

OO 


: X, tr 


fc=o 

OO 


6{h)pl{h)dh pkip) 


X^ktr [pk{p)] ■ 


k=0 

If we replace p with p~^ (^9i9j^^, then we get 


OO 

S{9~^9ij) =X]4tr pkig~^)pk 


k=0 

OO 


y^^dfctr pl{g)x[f 


fc=0 
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This means that, by the definition of the Dirac delta, we can require that 


^4tr pl{g)x\f >0 V5 G a, 


fc =0 


^4tr pl{g)X. 


^(k) 


dg = 1. 


(3.9) 

(3.10) 


This suggests relaxing (3.8) to 


minimize 

xw 


^tr 


k=0 


subject to ^ 0 


\^(^) _ T 

dj^ X df^ 


ii 

00 


^4tr pl{g)x\f 


/c=0 


>0 \/g^Q 


(3.11) 


^4tr ypl{g)X\.’\^ 4 = 1. 

k=o / 

For a nontrivial irreducible representation p^, we have Jg Pk{g)dg = 0. This means that the 
integral constraint in (3.11) is equivalent to the contraint 


x\f = l,Vi 1 . 


Thus, we focus on the optimization problem 


minimize 

xw 


^tr 


k=0 


subject to X^^'^ ^0 

v(^) —T 
ii dfg X df^ 


(3.12) 


'^dkii pl{g)xlf >0 yg£G 


k=0 

= 1- 

When ^ is a finite group it has only a finite number of irreducible representations. This means 


that (3.12) is a semidefinite program and can be solved, to arbitrary precision, in polynomial 
time [32]. In fact, when Q = Z^, a suitable change of basis shows that (3.12) is equivalent to the 


semidefinite programming relaxation proposed in [7| for the signal alignment problem. 

Unfortunately, many of the applications of interest involve infinite groups. This creates two 


obstacles to solving (3.12). One is due to the infinite sum in the objective function and the other 


due to the infinite number of positivity constraints. In the next section, we address these two 
obstacles for the groups 50(2) and 50(3). 


4. Finite truncations for 50(2) and 50(3) via Fejer kernels 


The objective of this section is to replace (3.12) by an optimization problem depending only in 
finitely many variables X^^l. The objective function in (3.12) is converted from an infinite sum to 
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a finite sum by truncating at degree t. That is, we fix a t and set = 0 for k > t. This consists 


of truncating the Fourier series of fij [9^9j ) ■ Unfortunately, constraint (3.9) given by 


J2dktT pl{g)X^,-^]>0 ygeg, 


k=0 


(k) 

still involves infinitely many variables and consists of infinitely many linear constraints. 

We now address this issue for the groups SO{2) and 50(3). 

4.1. Truncation for 50(2). Since we truncated the objective function at degree t, it is then 
natural to truncate the infinite sum in constraint (3.9) also at t. The irreducible representations 
of 50(2) are and dk = 1 for all k. Let us identify g G 50(2) with 6g G [0,27r]. That 

straightforward truncation corresponds to approximating the Dirac delta with 


<5(9) E 




k=—t 


This approximation is known as the Dirichlet kernel, which we denote as 


Dt{e) := ^ 

k=—t 

However, the Dirichlet kernel does not inherit all the desirable properties of the delta function. In 
fact, Dt{0) is negative for some values of 9. 

Instead, we use the Fejer kernel, which is a non-negative kernel, to approximate the Dirac delta. 
The Fejer kernel is defined as 


1 


t-i 


r.io) E 1 


t 

k=0 k=—t 

which is the hrst-order Cesdro mean of the Dirichlet kernel. 
This motivates us to replace constraint (3.9) with 

k=—t 

(—k) 

where, for A: > 0, X^ denotes 
This suggests considering 


t 


Jke 


_ u ) >0 V0 G [0,27r], 


(fc) 


^^3 


minimize 

x(^) 


^tr 


k=0 


subject to E 0 

_ F 


E 5-i 

k=-t ^ 

xf = 1. 

(k) 

which only depends on the variables Xb ' for k = 0,... ,k. 


e-'^^^x\f >0 V0 G [0, 27r] 
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Unfortunately, the condition that the trigonometric polynomial Ylk=-t ~ ® is al¬ 

ways non-negative, still involves an infinite number of linear inequalities. Interestingly, due to the 
Fejer-Riesz factorization theorem (see US]), this condition can be replaced by an equivalent condi¬ 
tion involving a positive semidefinite matrix — it turns out that every nonnegative trigonometric 
polynomial is a square, meaning that the so called sum-of-squares relaxation [261 116) is exact. How¬ 
ever, while such a formulation would still be an SDP and thus solvable, up to arbitrary precision, 
in polynomial time, it would involve a positive semidefinite variable for every pair {i,j), rendering 
it computationally challenging. For this reason we relax the non-negativity constraint by asking 
that “ ■^) is non-negative in a finite set Qt £ SO{2). This yields the following 

optimization problem: 


minimize 

xW 

subject to 


^tr 

k=0 

xW ^ 0 


= L 


d^xd^ 


E 

k=—t 


1 _ \M\ > 0 


= 1 . 


yO G Hi 


(4.1) 


4.2. Truncation for S'0(3). The irreducible representations of 50(3) are the Wigner-D matrices 
and = 2A:-|-1. Let us associate g G 50(3) with Euler (Z-Y-Z) angle (a,/3,7) G 
[0,27r] X [0,7r] x [0, 27r]. A straightforward truncation yields the approximation 


d(5)~^(2fc + l)tr flT('^)(a,/3,7) 


A:=0 


Observe that the operator tr is invariant under conjugation. Then can be decomposed as 


lT(^)(a,/3,7) = RK^^'>{e)R* 


such that 

= 

It follows that 

tr 

The relationship between 6 and a, /3,7 is 

9 = 2 arccos 


^-ik9 


o'lkO 


= tr 


A(O(0)1 = ^ ^m9 


l=-k 


, ( Q: + 7 

cos I 2 j cos ( 2 
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This relationship can obtained by directly evaluating tr /3, 7 )] using the Wigner-d matrix 


tr 


m =—1 
1 

m=—l 

= cos(/3) (1 + cos(q; + 7 )) + cos(a + 7 ). 

This straightfoward truncation at t yields 

t 

k=0 

which, again, inherits the undesirable property that this approximation can be negative for some 6. 
Recall that we circumvented this property in the 1 -dimension case by taking the first-order Cesaro 
mean of the Dirichlet kernel. In the 2-dimension case, we will need the second-order Cesaro mean. 
Notice that 

sin [{2k -h 1 ) 5 ] 


Dk{9) = 


sm 




Fejer proved that [1] 




A :=0 


{t-k)\ 


{2k + 1)2 


>0, 0 < 6» < vr 


where = ^{t — k + 2){t — k + 1). It follows that 


Let us define 




fc =0 

t 


{t-k)\ 


(3)t_fc / 1\ sin 1(2A:-I-l)fl 

Y A ( fe + R 1 --7r<0<7r. 


fc =0 


{t-k)\ 


sm 


Ft{g) = Ft{a, /3, 7 ) := Y + 


k=0 


l\ sm 
2 


{2k+ir-i 


sm 


where 9g = 2 arccos 


cos cos ■ ^Iso, let us define 


Bt ■■= / Ft{g)dg, 

JsO{3) 

where dg is the Haar measure. 

We replace constraint ( |3.9[ ) with 

■^Ft{a,/3,-f) > 0 V(a,/ 3 , 7 ) E [0,27r] x [0,7r] x [0,27r]. 

Ft 

Secondly, we discretize the group 50(3) to obtain a finite number of constraints. We consider 
a suitable finite subset lit C 50(3). We can then relax the non-negativity constraint yielding the 
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following semidefinite progranQ 

^tr 


minimize 

xw 


fc =0 

subject to ^ 0 


— J, 

1 {3)t-k 

x.M = 1. 


(4.2) 


k + -]ti 


(W^'^\a,/3,^)y 


X, 


(k) 


> 0 V(a,/3,7) G Qt 


5. Applications 


In this section, we estimate the solution to registration in 1-dimension using (4.1), and the 


solutions to registration in 2-dimensions and orientation estimation in cryo-EM using (4.2). For 
each problem, the only parameters we need to determine are the coefficient matrices Since 

CC^) = (^fij{k)y then it suffices to calculate the Fourier coefficients fij{k) for the respective 
problems. 


5.1. Registration in 1-dimension. Recall that X is the space of bandlimited functions up to 
degree t on S^. That is, for x G A, we can express 

t 

x{(jj) = ^ 
i=-t 

Again, the irreducible representations of SO{2) are and = 1 for all k. Let us identify 

g G 50(2) with 9g G [0, 27r], then 

t t 

g ■ x{oj) = Y, 

l = — t l = — t 

Let V sample the underlying signal x at L = 2t -|- 1 distinct points. This way, we can determine all 
the q:;’s associated with x. 

Since yt = V{gi ■ x) + ei, then we can approximate yi with the expansion 

t 

l = -t 


Let us identify gig - ^ with 9ij G [0, 27r] 


Then, we can express fij in terms of 


(i) 


(i) 

a I and 9ij: 



= WVi - 9ij ■ Vj 


||2 

lb 



^Similarly to SO{2), it is possible that the non-negativity constraint may be replaced by and SDP or sums-of- 
squares constraint. 
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5.2. Registration in 2-dimension. Recall that X is the space of bandlimited functions up to 
degree t on S'^. That is, for x G Tf, we can express 

t i 

x{u) = X] X] 

1=0 rn=—l 

where {him} are the spherical harmonics. Again, the irreducible representations of 5'0(3) on are 
the Wigner-D matrices /3, 7)}, and dk = 2k + 1. Let us associate g G 50(3) with Euler 

(Z-Y-Z) angle (a,/?,7) G [0,27r] x [0,7r] x [0, 27r], then 

t i 

g ■ x{u!) = Y^ lT'^J^,(a,/3,7)a«m'hlm(w). 

/=0 m,m'=—l 

Let V sample the underlying signal x at L = (t + 1)^ points. This way, we can determine all the 
o/m’s associated with x. 

Since y* = V{gi ■ x) + ei, then we can approximate yi with the expansion 

t i 

;=0 m=—l 

Let US identify gigj^ G 50(3) with Euler (Z-Y-Z) angle {ocij, fdij,'jij) G [0, 27r] x [0,7r] x [0,27r]. 
Then, we can express fij in terms of and {aij, I3ij,'yij): 

fij {diY) = Vi- 9igJ^ • Vj 2 

= Z] + \4if) - 2 Z Z 

/=0 m=—l 1=0 m,m'=—l 
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The Fourier coefficients are given by 
t i 


Mk) = 


IS0{3) 


EE 

/=0 m=—l 






^E E 

/=0 m^m'=—l 


U) 

Im' 




. 8772 
'2/0 + 1 


2^1=0 2^m=-l y\^lm\ ^ J ^“OO “OO ’ /I — U 

, /c / 0 




m,m'=—k 

Here, we used the orthogonality relationship 


/ »'+(a,/3.7)+‘2(«.)9.7)<i9= 51^4,1-4,pi. 

Jsois) 2/C + i 


5.3. Orientation estimation in cryo-EM. We refer to 
projection Jj can be expanded via Fourier-Bessel series as 


k=—oo q=l 


2 k+ 1 

to expand the objective function; 


where 




iNkgJkiRkS^""' > 

lo 


— 

r > c. 


The parameters above are dehned as follows: 

• c is the radius of the disc containing the support of T (recall cf) ^ X has compact support). 

• Jk is the Bessel function of integer order /c, 

• Rkq is the root of Jk, 

• Xko = — ^1 r ^ / D —TT is a normalization factor. 

''V Cv+|Jfe + l(Rfe5)| 

We can approximate each Fourier-Bessel expansion by truncating, he., 


Pk 




k — — fcmax q — 1 

See [33] for a discussion on fcmax and pk- For the purpose of this section, let us assume we have 
{“fcg • ~kma.x < k < /cmaxj 1 < ^ < Pk} for each Ij. (These can be computed from data.) 

We shall determine the relationship between Ii{r,Bi) and Ij{r,Bj), and the lines of intersection 
between g~^ ■ li and g~^ ■ Ij embedded in Mf. Recall from ( 2.11| ) and (2.12) that the directions of 
the lines of intersection between g~^ ■ li and g~^ ■ Ij are given, respectively, by unit vectors 


-1 


Cij [giQj I — 


es X gig - ^ ■ 63 
les X gtg^^ ■ 63] 




■ji \ 9i9j ) — 


es X igiPj 


,-i 


-1 


• 63 


+3 X [gig 




-1 


• 63 
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Let US associate gig^ G 50(3) with Euler (Z-Y-Z) angle {aij, Pij, ^yij) G [0,27r] x [0,7r] x [0,27r]. 
Then 

— sin aij sin Pij 

63 X gig~^ • 63 = - cos aij sm(3ij , 


1 sin jij sin (3ij 

63 X - 63 = cos Jij sin f3ij 

0 


The directions of the lines of intersection in li and Ij under gig^ ^ are in the directions, respectively, 


COSQ!,-i \ TT 


COSJij\ TT 

= arctan ^^ = - - 7 ^ •. 

\sm-/ij J 2 


(0 f?) 

We express the fij^s in terms of , and 6i and 9j: 


^j) • — fij (didj 


^max Pk 


£ ("2dg " “2dg )) 

^max j^2 

Y. cN„Ny,, - age'".) (a« .e''-''"- - 


k,k' 


For each k, k', q, q', we approximate the integral 


Jk {Rkq'^) Jk (.^k'q'^^dr 


Jk{Rkqr)J'k{Rk'q'r)dr. 


with a Gaussian quadrature using the roots of P 2 s{y/x) and weights Here, P 2 s is the Legendre 
polynomial and we specify a suitable s. 

Using the approximation above, we have 

ftjiOi,ej) FS ^ bk,k',q,q' 

k,k',q,q' \ 


where 


^kq^k'q'^ 


^kq ^k'q'^ 


bk,k',q,q' cNkqNk'q' ^ ^ 


Jk{RkqXi')Jk{Rk'q'Xi') 


and Xj’s are the roots of P 2 s{\/x). 
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In terms of the Euler (Z-Y-Z) angles, 


' E 

k,k',q,q' \ 




The Fourier coefficients are by 


kik)= / h,{a,^)W^'^\a,P,j)dg 


iij 

ISO(3) 

p2TV r27v 

'o Jo 


fijia,^) ( / {a, P,^) sin pdp ] dadj 


'0 


The {m,m'Y^ entry of fij{k) is approximated by 

r2n r2n 


(/t 


m,m' Jq Jq 




= 2«>™L.{»/2) 


/o 

/*27r /*27r 

/o Jo 


dad-f 

/ij(a,7)e-‘”^“e-™'^dad7 


X!/ ^^142,91,92 ( ®iigi®i2g2'^0,m'5o,fci-fc2+m 

ki,k 2 ,qi,q 2 V 


(j*) (?) C S' 

■*"“fcigi'^fc2g2°0,m'20,A:i-fc2+m' 


(j) (i) r X 

““fciqi“fc2q2°^l’-m'0fc2,-r 


(i) (») X X 

“fcigi “^292 -mOk2-m' 


Here, is the Wigner-d matrix, 6 is the Kronecker delta and ^^ 1 ,^ 2 , 91,92 absorbed ^ 2 ) 


6. Conclusion 

This short manuscript is a preliminary version of a future publication with the same title and 
same authors. In particular, we defer an extensive numerical study of this approach to the future 
publication. It is worth mentioning that preliminary numerical simulations suggest that this ap¬ 
proach has a tendency to be tight, meaning that the solution to the relaxation often coincides with 
the solution to the original NUG problem under certain nonadversarial noise models. This tendency 
for certain semidefinite relaxations was conjectured in [9] and established in [ 6 ] for a particularly 
simple instance of angular synchronization. 

In many applications, such as the structure from motion problem in Computer Vision MM, 
the group G is non-compact. However, such groups can often be compactified by mapping a subset 
of ^ to a compact group. One example is to treat the group of Euclidean motions in d dimensions 
by mapping a bounded subset to S'0(d -|- 1) (see [27] for a description of the case d = 1). 

Acknowledgements. The authors would like to thank Moses Charikar and Peter Sarnak for many 
insightful discussions on the topic of this paper. 
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